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We consider asymptotic problems in spectral analysis of sta- 
tionary causal processes. Limiting distributions of periodograms and 
smoothed periodogram spectral density estimates are obtained and 
applications to the spectral domain bootstrap are given. Instead of 
the commonly used strong mixing conditions, in our asymptotic spec- 
tral theory we impose conditions only involving (conditional) mo- 
ments, which are easily verifiable for a variety of nonlinear time se- 
ries. 



1. Introduction. The frequency domain approach to time series analysis 
is an important subject; see [1, 7, 29] and [52] among others. An asymptotic 
distribution theory is needed, for example, in hypothesis testing and in the 
construction of confidence intervals. However, most of the asymptotic results 
developed in the literature are for strong mixing processes and processes 
with quite restrictive summability conditions on joint cumulants [6, 7, 56, 
57]. Such conditions seem restrictive and they are not easily verifiable. For 
example, Andrews [2] showed that, for a simple autoregressive process with 
innovations being independent and identically distributed (i.i.d.) Bernoulli 
random variables, the process is not strong mixing. Other special processes 
discussed include Gaussian processes [60, 61] and linear processes [1]. 

There has been a recent surge of interest in nonlinear time series ([21, 53] 
and [65]). It seems that a systematic asymptotic spectral theory for such 
processes is lacking [11]. The primary goal of this paper is to establish an 
asymptotic spectral theory for stationary, causal processes. Let (en)nGZ be 
a sequence of i.i.d. random variables; let 
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where G is a measurable function such that Xn is a proper random variable. 
Then the process (X„,) is causal or nonanticipative in the sense that it only 

depends on ^„ = (. . . ,e„_i,e„), not on the future innovations £n+i-,Sn+2-, 

The class of processes within the framework of (1.1) is quite large (cf. [53, 
65, 66] and [74] among others). 

Assume throughout this paper that (X„)„gz has mean zero and finite 
covariance function r{k) = E(XoXfc), fc E Z. Let i = \/— 1 be the imaginary 
unit. If {Xn) is short-range dependent, namely 

oo 

(1.2) 5]|r(A;)|<oo, 

fc=0 

then the spectral density 

fcez 

is continuous and bounded. Given the observations Xi,. . . let 

n , 

Sn{e) = Y.Xke'^' and /„(0) = _|5„(0)|2 
fc=i 

be the Fourier transform and the periodogram, respectively. Let 6k = 27rA;/n, 
1 < A; < n, be the Fourier frequencies. Primary goals in spectral analysis in- 
clude estimating the spectral density / and deriving asymptotic distributions 
oiSn{e) and/„(0). 

We now introduce some notation. For a column vector x = (xi, . . . , Xq)' G 
R"?, let l^l = (Ej=i^j)^^^- Let ^ be a random vector. Write ^ G £p (p > 0) 
if ll^llp := [E(|^|p)]Vp < oo and let || • || = || • lb- For i e define projec- 
tion operators Vki = E(^|.Ffc) — E(^|.Ffc_i), k £ Z, where we recall J^k = 
(. . . ,£k-i,£k)- For two positive sequences (a^), (6^), denote by x bn that 
there exists a constant c such that < c < an/hn < 1/c < oo for all large 
n and by o„ ~ 6.„ that a„/6.„ ^ 1 as n ^ oo. Let C > denote a generic 
constant which may vary from line to line; let $ be the standard normal 
distribution function. Denote by convergence in distribution and by 

A^(/i,(T^) a normal distribution with mean /u and variance cj^. All asymp- 
totic statements in the paper are with respect to n — > oo unless otherwise 
specified. 

The paper is structured as follows. In Section 2 we shall establish a cen- 
tral limit theorem for the Fourier transform Sn{6) at Fourier frequencies. 
Asymptotic properties of smoothed periodogram estimates of / are discussed 
in Section 3. Section 4 shows the consistency of the frequency domain boot- 
strap approximation to sampling distributions of spectral density estimates 
for both linear and nonlinear processes. Section 5 gives sufficient conditions 
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for geometric moment contraction [see (3.1)], a basic dependence assump- 
tion used in this paper. Some examples are also presented in that section. 
Proofs are gathered in the Appendix. 

2. Fourier transforms. The periodogram is a fundamental quantity in 
frequency domain analysis. Its asymptotic analysis has a substantial his- 
tory; see, for example, [57], Theorem 5.3, page 131, for mixing processes; [8], 
Theorem 10.3.2, page 347, [63] and [70] for linear processes. Other contribu- 
tions can be found in [38, 46, 55, 71] and [77]. Recently, in a general setting, 
Wu [73] considered asymptotic distributions of Sn{0) at a fixed 9. However, 
results in [73] do not apply to Sn{0) at the Fourier frequencies. Here we 
shall show that Sn{Ok) are asymptotically independent normals under mild 
conditions; see Theorem 2.1 below. The central limit theorem is applied to 
empirical distribution functions of normalized periodogram ordinates (cf. 
Corollary 2.2). In the literature the latter problem has been mainly studied 
for i.i.d. random variables [25, 26, 36] and linear processes [12]. 

Denote the real and imaginary parts of Sn [dj )/ ^ TTnf{Oj ) by 
7 _ ELi^fcCOs(fcg,) _ ELi^fcsm(fc%) _ 

where m = rUn '■= [{n — l)/2j and [aj is the integer part of a. Let ^Ip = 
{c € R^:|c| = 1} be the unit sphere. For the set J = {ji, ■ ■ ■ ,jp} with 
1 < ji < • • • < < 2m write the vector Zj = [Zj-^ , . . . , Zj^)' . Let the class 
= { J C {1, . . . , 2m} : = p}, where is the cardinality of J. 

Theorem 2.1. Assume Xt £ C? , 

oo 

(2.1) K:=Y,\\'PoXk\\<oo 

and := min^gR f{9) > 0. Then for any fixed p G N, we have 

sup sup sup |P(ZjC < x) — <I>(2;)| = o(l) asn— >cxd. 

Theorem 2.1 asserts that the projection of any vector of p of the Zj's 
on any direction is asymptotically normal. The condition (2.1) was first 
proposed by Hannan [30]. In many situations it is easily verifiable since it 
only involves conditional moments. For generalizations see [75]. In the special 
case of linear processes Xt = Z)j^o ^j^t-j, where £j are i.i.d. with mean and 
finite variance and X^j^o'^l < oo, (2.1) becomes Z^j^ol'^jl < indicating 
that (Xn) is short-range dependent. In the literature, central limit theorems 
are established for Fourier transforms of linear processes ([21], page 63; [8], 
page 347, among others). The spectral density may be unbounded if (2.1) is 
violated. 
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Corollary 2.1. Let q^G N. Under the conditions of Theorem 2.1, we 
have 

r Sn{9i^) , 1 < J- < g j ^ { y + ^ y 1 < J- < g } 



for integers 1 < h < I2 < ■ • ■ < Iq < m, where the indices Ij may depend on n, 
and Yfc, 1 < A; < 2q, are i.i.d. standard normals. Consequently, for ln{0) ■= 

in{e)/f{e), 

{in{ei^),l<j<q}^{Ej,l<j<q}, 
where Ej are i.i.d. standard exponential random variables [exp(l)]. 

Corollary 2.1 easily follows from Theorem 2.1 via the Cramer- Wold de- 
vice. Let 

-j^ m 

i=i 

be the empirical distribution function of In{Ok) and Fe{x) := 1 — e~^, 2; > 0. 

Corollary 2.2. Under the conditions of Theorem 2.1, we have 
(2.2) sup |Fj ^(x) — — > in probability. 

x>0 

Proof. Since Fj ^ and Fe are nondecr easing, it suffices to show (2.2) 
for a fixed x. Let pj = Pj{x) = P[In{6j) < x] and pj^k = Pj,k{x) = P[/n(^j) < 
X, In{&k) < x\; let U and V , independent of the process {Xj), be i.i.d. 
uniformly distributed over {l,...,m}. By Corollary 2.1, pu ^ Fe{x) and 
Puy ~^ Fe{x)'^ almost surely. By the Lebesgue dominated convergence the- 
orem, E(pf/) Fe{x) and E(p(/y) — > Fe{x)'^ . Notice that 

m mm 

E(pj/) = m"^ ^Pj and ^{pu,v) = m'"^ ^ ^ pj^k- 
j=i j=ik=i 

So \\Fi,m(^) - FE{x)f = E{pu,v) - F|(x) + 2Fe{x){Fe{x) - E{pu)} and 
(2.2) follows. □ 

Remark 2.1. The above argument also implies that, for any integer 
k>2. 



sup 

Xi,...,Xi:>0 

in probability. 



m ^ 
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Fay and Soulier [22] obtained a functional central limit theorem for Fj ^{x) 
for i.i.d. random variables. It seems very difficult to generalize their results 
to the nonlinear case. 

3. Spectral density estimation. Given a realization (Xj)"^^, the spectral 
density / can be estimated by 



where Wn{\) is a smoothing weight function [cf. (3.2)]. Here we study asymp- 
totic properties of the smoothed periodogram estimate /„. Spectral density 
estimation is an important problem and there is a rich literature. However, 
restrictive structural conditions have been imposed in many earlier results. 
For example, Brillinger [6] assumed that all moments exist and cumulants 
of all orders are summable. Anderson [1] dealt with linear processes. Rosen- 
blatt [56] considered strong mixing processes and assumed the summability 
condition of cumulants up to the eighth order. Due to those limitations, the 
classical results cannot be directly applied to nonlinear time series. Recently, 
Chanda [11] obtained asymptotic normality of /„, for a class of nonlinear pro- 
cesses. However, it seems that his formulation does not include popular non- 
linear time series models including GARCH, EXPAR and ARMA-GARCH; 
see Section 5 for examples. 

To establish an asymptotic theory for we shall adopt the geometric- 
moment contraction (GMC) condition. Let (e^)fcGZ be an i.i.d. copy of 
(£A:)fcGz; let X'^ = G{. . . ,e'_i, eo,ei, . . . be a coupled version of X„. We 
say that X„ is GMC(q), a > 0, if there exist C > and < p = p{a) < 1 
such that, for all n E N, 



Inequality (3.1) indicates that the process (Xn) quickly "forgets" the past 
JTq = (. . . ,e_i,eo). Note that under GMC(2), \r{k)\ = 0{p^) for some p G 
(0, 1) and hence the spectral density function is infinitely many times differ- 
entiable. 

Many nonlinear time series models satisfy GMC (cf. Section 5). Moreover, 
the GMC condition provides a convenient framework for a limit theory for 
nonlinear time series; see [32, 75] and [76]. In view of those features, instead 
of the widely used strong mixing condition, we employ the GMC as an un- 
derlying assumption for our asymptotic theory of spectral density estimates. 

Let f(A;) = n~^J2^=i^ -^j^j+\k\^ \k\ < n, be the estimated covariances; 
let a(-) be an even, Lipschitz continuous function with support [—1, 1] and 
a(0) = 1; let i?„ be a sequence of positive integers with i?„ oo and Bn/n 




(3.1) 



E{\X'^-XX)<Cp^. 
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0; let 



bn = l/B, 



Wn{X) 



1 



^ a{kbn)e 
~-B„ 



—ikX 



and 




(3.2) 



/n(A) 



1 



^ r{k)a{khn)e 
~-B„ 



—ikX 




Theorem 3.1. Assume (3.1), X„ G C'^'^^ for some 6 >0, Bn—>- oo and 
S„ = o[n/ (log n) 2+8/^]. Th en 



where := a\X) = {1 + ri{2X)}f{X) j\a\t) dt and 77(A) = 1 ifX = 2k7r 
for some integer k and r]{X) = otherwise. 

Remark 3.1. The GMC has this interesting property: If Xn G CP,p > 0, 
and GMC(ao) holds for some oq > 0, then Xn is GMC(a) for all a G (0,p) 
([75], Lemma 2). 

By Remark 3.1, the moment condition X„ G C^~^^ in Theorem 3.1 together 
with GMC (a) implies GMC (4) and consequently the absolute summability 
of cumulants up to the fourth order (cf. Lemmas A.l and A. 2). In the context 
of strong mixing processes, Rosenblatt ([57], page 138) imposed Xn G>C®. 
Rosenblatt [57] also posed the problem of whether the eighth-order cumu- 
lant summability condition can be weakened to fourth order. Theorem 3.1 
partially solves the conjecture for nonlinear processes satisfying GMC under 
the moment condition X„ G C^~^^ . Additionally, Theorem 3.1 is applicable 
to a variety of nonlinear time series models (Section 5) that are not covered 
by Chanda [11]. 

Joint asymptotic distributions of spectral density estimates at different 
frequencies (cf. Corollary 3.1 below) follow from the arguments in [48], The- 
orem 5 A and [56] since GMC (4) ensures the summability of the fourth cu- 
mulants; see Lemma A. 2. 

Corollary 3.1. Let Xi,...,Xs G [0, vr] be s different frequencies. Then 
under the conditions of Theorem 3.1, \/ra6^{/„(Aj) — E(/„(Aj))}, j = 1, . . . , s, 
are jointly asymptotically independent N{0, a'^{Xj)), j = 1, . . . , s. 



(3.3) 



^{fn{X)-B{fn{X))}^NiO,a\X)), 



The problem of maximum deviation of spectral density estimates has been 
studied by Woodroofe and Van Ness [72] for linear processes and Rudzkis 
[58] for Gaussian processes. For nonlinear processes, we have: 
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Theorem 3.2. Assume (3.1), Xn G C^~^^ for some 6 G (0,4], i?„ oo, 
Bn = 0{n'>), 0<r]<6/{4 + 6) and := minn f {9) > 0. Then 

(3.4) max V^\fn{\) - E(/„(A))| = Op((logn)^/2). 

AG[0,7r] 

Under GMC(2), since ||Po-'^fc|| = C>(/"), we have (2.1). However, it is quite 
difficult to establish (3.3) under the weaker condition (2.1). Regarding (3.4), 
for linear processes the distributional result in [72] implies that the bound 
Op((logn)^/^) is optimal. We are unable to obtain a similar distributional 
result for nonlinear processes. 

For long memory processes, (1.2) is violated and / may not be well de- 
fined, so Theorems 3.1 and 3.2 are not applicable. A simple example is the 
fractionally integrated process (1 - B)'^Xj = Ej, where < d < 1/2 is the 
long memory parameter, B is the back-shift operator and £j are i.i.d. with 
mean and finite variance. Then the spectral density /(A) x |A|~^"' as A — > 
and /(O) is not well defined. In this case an important problem is to estimate 
d; see [54] and [59] and references cited therein. 

4. Frequency domain bootstrap. Here we consider bootstrap approxi- 
mations of the distribution of the lag window estimate (3.2). Bootstrapping 
in the frequency domain has received considerable attention. See [33, 45] 
and [64] for Gaussian processes and [24, 37] and [47] for linear processes. 
For nonlinear processes we adopt the residual-based bootstrap procedure 
proposed by Franke and Hardle [24]. A variant of it is discussed in Remark 
4.4. Let Ij = I{LOj), LOj = 27rj/n,j G F„ = {-[(n - l)/2j, . . . , [n/2j}. Note 
that f{k) = n~^27rJ2j£F„^j^^'"^^ • Then the lag window estimate (3.2) can 
be written as 

fnW = Y- E nkHkbr^)e~'''^ = -J2lj J2 a(fc6„,)e-^'=(^~-^). 

^ k=^B„ " j&Fn k=-B„ 

(4.1) 

The bootstrap procedure consists of the following several steps: 

1. Calculate periodogram ordinates {Ij}, j = 1, . . . , := [n/2j . 

2. Obtain an estimate f of f (e.g., a lag window estimate with band- 
width bn := B~^). 

3. Let Ej = Sj/e, where Sj = Ij/ fj, fj = fi^j) and e = J2jLi ^j- 

4. Draw i.i.d. bootstrap samples {Sj} from the empirical distribution 

of Ej. 

5. Let Ij = fjSj be the bootstrapped periodograms; let /* = Ij and 
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The rescaling treatment in step 3 avoids an unpleasant bias at the re- 
samphng stage. Setting Jq = in step 5 corresponds to the fact that, for 
a mean-corrected sample, the periodogram value is at frequency 0. The 
sampling distribution of 5n(A) = \/n6„{/„(A) — /(A)} is expected to be close 
to its bootstrap counterpart 5^ (A) = ^/nb^{f*{X) — /(A)}, where 

jeFn k=-B„ 

is the bootstrapped version of (4.1). Here we measure the closeness by Mal- 
lows' d2 metric [4]. For two probability measures Pi and P2 on R with 
/j^ |xp dPj < 00, j = 1, 2, let d2(-Pi; -P2) = inf ||li — 5^2|| , where the infimum 
is taken over all vectors {Yi,Y2) with marginal distributions Pi and P2- 
Write 

d2bn(A),5;(A)] =d2{P[5n(A) G •],P[5;(A) G • |Xi , . . . , . 

The bootstrap procedure is said to be (weakly) consistent ii d2[gnW , dnW] — 
op(l). Let C{-\Xi, . . . , Xn) denote the conditional distribution given the sam- 
ple Xi, . . .,Xn. 

It seems that in the literature the theoretical investigation of the consis- 
tency problem has been limited to linear processes. Let Xt = J2'jL-oo^j^t-j • 
Franke and Hardle [24] proved the consistency of their residual-based pro- 
cedure under the condition 

(4.2) sup{|E(e^"^i)|; |n| >6}<1 for all 6>0. 

Condition (4.2) excludes many interesting cases. For example, it is violated 
if El is a Bernoulli random variable. Franke and Hardle [24] conjectured 
that their results still hold without (4.2). The latter condition is removed 
in Corollary 4.1 of Theorem 4.1 below at the expense of the stronger eighth 
moment condition. Theorem 4.1 is also applicable to nonlinear processes; 
see Corollary 4.2. Since our results hold under various combinations of con- 
ditions, it is convenient to label the common ones: 

(Al) lima;^o 2;~^{1 — cl{x)} = C2, where C2 is a nonzero constant. 
(A2) minAg[o,^] /(A) > 0. 

(A3) max;,g[o,.] I/(A) - /(A)| = op(fen). 
(A3') max;,e[o,.] I/(A) - /(A)| = op(l). 
(A4) Efcezk(fc)|fc'<oo. 
(A4') Ek^zHmiKoo. 

(A5) Eti,...,tfe_iGz|cum(Xo,Xti,...,Xj^_J| < 00 for /c = 3,4. 

(A5') X;ti,...,tfe_iGz|cum(Xo,Xj,,...,Xf^._J| < cx) for A; = 3, . . . , 8. 

(A6) V^{/„(A)-E(A(A))}^iV(0,a2(A))andn6„var(/„(A))^a2(A). 
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Remark 4.1. Condition (Al) says that a(-) is locally quadratic at and 
it is satisfied for many lag windows. It is related to the bias. By Anderson [1], 
Theorem 9.4.3, or Priestley [52], page 459, under (Al), (A4) and = o(n), 

Bl{E{UX))-f{X)}^C2f"{X), 

(4.3) 

where /"(A) = E r{k)k^e-'^^. 

Additionally, if (A6) holds, then the optimal bandwidth is of order n~^/^ 
in the sense of mean square error. 

Remark 4.2. The cumulant summability conditions (A5) and (A5') 
are commonly imposed in spectral analysis [7, 57]. For the linear process 
Xt = ET=-oc (^j^t-j with Ej^-oo \aj\ <oo, (A5) [resp. (A5')] holds if ei e 
[resp. £i S C^]. By Lemma A.l, for the process (1.1), (A5) [resp. (A5')] 
is satisfied under GMC(4) [resp. GMC(8)]. Zhurbenko and Zuev [79] and 
Andrews [3] considered strong mixing processes. 

Let P*, E* and var* denote the conditional probability, expectation and 
variance given Xj,l<j< n; let K(A) = Vnb^{fnW - E(/„,(A))}, V'„*(A) = 
V^{/*(A)-E*/*(A)}, /3n(A) = V^{E(^(A))-/(A)} and/3;(A) = V^x 
{E*/*(A) — /(A)}. For the consistency of the bootstrap approximation, it is 
common to treat the variance and the bias separately. 

Proposition 4.1. Assume G £^ (A2), (A3), (A4'), (A5') and (A6). 
Let Bl = o{n). Then d2[K(A), ^^(A)] ^ m probability. 

Proposition 4.2. Assume Xt & C^, (Al), (A4) and (A5). Let bn = 
o{K), Bl = o{n) and Bl = o{n). Then Bl{W fl{X) - /(A)} ^ C2/"(A) in 
probability. 

Remark 4.3. The condition 6„ = o(6„) is needed to ensure the consis- 
tency of the bias; see (4.3). So /(A) is smoother than /n(A). Oversmoothing 
is a common practice in the frequency domain bootstrap [24, 37, 47]. 

Theorem 4.1. Assume Xt £ , (Al), (A4), (A5') and (A6). Let 6„ x 
n~^/^ andbn = o(bn). T/ien (i2[5'n(A),5* (A)] = op(l) and d2[5'„(A)//(A), g* (A)/ 
/(A)]=op(l). 

Proof. In the proof A is suppressed and we write gn, and so on, for 
5n(A), and so on. Since dl{gn, gt) = dl{Vn,V*) + dl{Pn, P*) ([4], Lemma 8.8), 
by Propositions 4.1, 4.2 and (4.3), d2{gn,gn) = op(l). The second assertion 
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follows similarly. By (A2), (A3) and Proposition 4.2, (3*^/1 -[in/ f = (Pn - 
fin)/f + if-^ - f-^)Pn = op(l). It remains to verify (i2(K//, V*/f) = op(l). 
By Lemma 8.3 in [4], it suffices in view of (A6) to show that var* {V* / f) — > 
a^/f and C{V* / f\Xi, . . . , Xn) ^ N{0,a'^ / f) in probability. By (A2) and 
(A3), these two assertions follow from relation (A. 21) in the proof of Propo- 
sition 4.1. □ 



Remark 4.4. Since the residuals {Ini^^j)/ f{ujj)} are asymptotically 
i.i.d. exp(l) (Corollary 2.1), a modified procedure is to replace the boot- 
strapped residuals by i.i.d. standard exponential variables. For this mod- 
ified bootstrap procedure. Theorem 4.1 holds with the assumption (A5') 
replaced by (A5) and the eighth moment condition weakened to Xt ^ C^; 
see the proof of Proposition 4.1. 

Corollary 4.1. Let Xt = J2^~oo (^j^t-j , where \ak\ = 0{\k\~^~'^), 
(3 > 1/5 and ei G . Assume (Al), (A2), (A4), 6„ x n~^/^ and K x n"''!, 
r/i G (1/10,1/5). Then the conclusions in Theorem 4.1 hold. 



Proof. By Theorem 4.1, it suffices to verify (A3), (A5') and (A6). (A6) 
follows from Theorems 9.3.4 and 9.4.1 in [1]. The assumption (A5') is sat- 
isfied under E(ef) < oo and |afc| = 0{\k\~^~f^), (3 > 1/5 (see Remark 4.2). 
Note that 

max |/(A)-/(A)| 

AG[0,7r] 

(4.4) 

< max |/(A) - E(/(A))| + max |E(/(A)) - /(A)|, 

Ae[0,7r] Ae[0,7r] 

which is of order Op{{logn)^/'^ /{nbn)^^^) + Op{bl) = op(6„) by Theorem 2.1 
in [72] and (4.3). So (A3) follows. □ 



Corollary 4.2. Let the process (1.1) satisfy GMC{8). Assume (Al), 
(A2), bn n^^/^ and bn n~'^^ , 7]2 £ (1/10, 1/5). Then the conclusions in 
Theorem 4.1 hold. 



Proof. We shah apply Theorem 4.1. By Lemma A.l, GMC(8) imphes 
(A4) and (A5'), while (A6) [resp. (A3)] follows from Theorem 3.1 [resp. 
Theorem 3.2 and (4.4)]. □ 

5. Applications. There are two popular criteria to check the stationarity 
of nonlinear time series models, drift-type conditions [10, 23, 42, 67, 68, 69] 
and contraction conditions [16, 19, 34, 76]. It turns out that contraction 
conditions typically imply CMC under some extra mild assumptions, and 
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are thus quite useful in proving limit theorems [32, 75]. In this section we 
consider nonlinear autoregressive models and present sufficient conditions 
for GMC so that our asymptotic spectral theory is applicable. 
Let £,en be i.i.d., p,d>l; let X„ S R'^ be recursively defined by 

(5.1) Xn+i = R{Xni ■ • ■ , X„_p+i; e„+i), 

where i? is a measurable function. Suitable conditions on R implies GMC. 

Theorem 5.1. Let q > and a' = min(l, a). Assume that R{yo; e) G 
for some yo and that there exist constants ai, . . . , Op > such that J2^=i < 
1 and 

(5.2) \\Riy;e)-R{y';e)t <j2aj\xj-x'jr' 

holds for all y = {xi, ... ,Xp) and y' = {x'l, . . . ,Xp). Then (i) (5.1) admits a 
stationary solution of the form (1.1) and (ii) Xn satisfies GMC{a). In par- 
ticular, if there exist functions Hj such that \R{y;e) — R{y';e)\ < J2^=i Hj{e)\xj - 

x'j\ for all y and y' and Z]j=i ll-f^j(£)lla' < 1; then we can let aj = \\Hj{£)\\'^ . 

Duflo [18] assumed a > 1 and called (5.2) the Lipschitz mixing condition. 
We allow a < 1. Similar conditions are given in [27]. 

Proof of Theorem 5.1. It follows from the arguments in [76] and 
Lemma 6.2.10 and Proposition 6.3.22 in [18]. For completeness we include 
the proof here. Without loss of generality let d = 1. Let a < 1. For y = 
(xi, . . . , Xp) G define the random map Re{y) = (-R(y, e), xi, . . . , Xp-i). Let 
Zm{y) be the first element of the vector Reo° Re^i° " '° Re-miv)-: where m 
is a nonnegative integer. By (5.2), we have for m>p that 

\\Zm{y) - Zmiy'm < Ea,||^„^-,(y) - Zm-,{y'm. 

Since ai, . . . , are nonnegative and Z]j=i Oj < 1, it is easily seen that the 
preceding relation implies that there exist constants C > and Aq G (0, 1) 
depending only on ai, . . . ,ap and a such that 

(5.3) \\zUy)-zUy')\\a<cx^\y-yT 

holds for all m > 0. See also Lemma 6.2.10 in [18]. Applying (5.3) with y = yo 
and y' = -R£_,„_i(?/o)) since Aq < 1 and a < 1, 

Coo \ oo 

Y l^m.(yo) - Zm+i{yo)\ j < Y ll^m.(yo) - -^m+i(yo)ll" 
m=0 / m=0 

oo 



<cJ2>^o\\yo-Reiyo)€<oo. 



m=0 
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So {Zm{yo)}m>o is a Cauchy sequence and it has an almost sure limit Z^o 
(say) which is in C^. Since Z^c is ^o-™easurable, we can write Z^o = G{J^o) 
for some measurable function G. By (5.3), for any y, Zm{y) converges al- 
most surely to the same limit Z^o- So we can express X„ = G{J-n), n G Z. 
Let !F'- = {. . . ,e'j_i,e'j). By stationarity, (ii) follows from (5.3) by letting 
y = (G(.7^_™_i ),..., G(.7^_^_p)) and y' = {G{J^1^_,), . . . ,G{:Fl^_p)). The 
other case a > 1 can be similarly dealt with. See Proposition 6.3.22 in [18]. 
□ 

Theorem 5.2. Let {r]t) satisfy GMC{a); let 6*1, (/>!,..., p, g G 
N, he real coefficients and the roots of the equation — J2^=i ^fcA^~^ = 
lie inside the unit circle. Then the autoregressive moving average (ARM A) 
{p, q) process Xt defined below also satisfies GMC{a) : 

Xt - OiXt^i 6pXt-p = r]t- (piVt-i 4>gr]t-g- 

Theorem 5.2 shows that the GMC property is preserved in ARMA mod- 
eling [43] and that it is an easy consequence of the representation Xt = 
J2'k^=obk'nt-k with \bk\ < Cp^ for some p € (0, 1). Min [43] considered the case 
a > 1. Theorem 5.2 imphes that the ARMA-ARCH and ARM A-G ARCH 
models [39] are GMC; see Examples 5.4 and 5.5. 

Near-epoch dependence (NED) is widely used in econometrics for central 
limit theorems [14, 15]. The process (1.1) is geometrically NED [G-NED(a)] 
on (es) in La, a > 0, if there exist C < oo and p G (0, 1) such that 

||^t-E(Xt| £t-m,£t-m+l, ■ ■ ■ iet)||a < Cp™" 

holds for all m G N. It is easily seen that, for a > 1, GMC (a) is equivalent to 
G-NED(a). In certain situations GMC is more convenient to work with; see 
Remark 5.1. Additionally, GMC has the nice property that X^ is identically 
distributed as Xt, while in NED the distribution of E{Xt\et^m, •■•,£<) typ- 
ically differs. Here we list some examples that are not covered by Davidson 
[15]. 

Example 5.1. Amplitude-dependent exponential autoregressive (EX- 
PAR) models have been studied by Jones [35]. Let Sj G be i.i.d. innova- 
tions and 

Xn = [ai + /3i exp(-aX^_i)]X„_i + 

where ai, Pi,a > are real parameters. Then Hi[e) = |ai| -|- \I3\\. By Theo- 
rem 5.1, Xn is GMC(a) if |ai| + < 1. 
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Example 5.2. Let 0i, . . . ,0^ be real parameters and consider the AR(2) 
model with ARCH(2) errors [20], 

Theorem 5.1 is applicable: we can let Hi{e) = \9i\ + Ie04| and H2{e) = \d2\ + 
\e95\. Then GMC(a), a > 0, holds if Ej=i \\Hj{e)\\^ < 1 and ei e 

Let At he p X p random matrices and -B^ be p x 1 random vectors. The 
generalized random coefficient autoregressive process (Xt) is defined by 

(5.4) Xt+i = At+iXt + Bt+i, t€Z. 

Let {At, Bt) be i.i.d. Bilinear and GARCH models fall within the framework 
of (5.4). The stationarity, geometric ergodicity and /3-mixing properties have 
been studied by Pham [50], Mokkadem [44] and Carrasco and Chen [9]. Their 
results require that innovations have a density, which is not needed in our 
setting. 

For a p X p matrix A, let \A\a = sup^^o l^-^la/lzla, a > 1, be the matrix 
norm induced by the vector norm = (E^=il-Zil°)^/". It is easily seen 
that Xt is GMC(a), a > 1, if E(|^o|a) < 1 and E(|5o|q) < oo. By Jensen's 
inequality, we have E(log|^o|a) < 0. By Theorem 1.1 of [5], 

oo 

(5.5) Xn = '^^AnAn^l- ■ ■ An-k+lBn-k 

k=Q 

converges almost surely. 

Example 5.3. Consider the subdiagonal bilinear model [28, 62] 

p q P Q 

(5.6) Xt = Yl + + mi bjkXt-j-k^t-k- 

j=l j=0 j=Ok=l 

Let s = max(p, P + q, P + Q), r = s — niax{q, Q) and Op+j = = Cg+j = 
bp+k,Q+j = 0, k,j > 1; let be a 1 X s vector with the (r + l)st element 
1 and all others 0, c be an s x 1 vector with the first r — 1 elements fol- 
lowed by 1, oi + ci, . . . , Us-r + Cs-r, and d be an s x 1 vector with the first 
r elements followed by boi, ■ ■ ■ , fco,s-r- Define the s x s matrices 



A = 



/O 1 

















1 








ai 


■•. 






1 


\as ■■■ ■ 


• • ds—r 


0/ 
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/ ••• •• 



B 





brl 





^01 



\br 



bo,. 















Let Zt he an s X 1 vector with the jth entry Xf-r+j if 1 < J < and 

r s~r ( P ^ 

akXt+j~k + ^^{ck + ^^ bikXt+j-k-i \£t+j-k 

k=j k=j I /=0 J 

if 1 + r < J < s. Pham [49, 51] gave the representation 



(5.7) 



Xt = HZt-i + et, Zt = {A + Bet)Zt^i + cet + de^ 



By (5.7), Xt is GMC(a), a > 1, if ei G £2a E(|^ + Sei|°) < 1. By (5.5), 
Zt admits a casual representation and so does Xt. 

Remark 5.1. Davidson [15] considered the bihnear model (5.6) with 
q = and Q = 1. He commented that, due to the complexity of moment ex- 
pressions, it is not easy to show G-NED(2) for general cases. In comparison, 
our argument works. 

Example 5.4. Ding, Granger and Engle [17] proposed the asymmetric 
GARCH(r, s) model 



?/2 



(5.8) Xt = et^ht, 

i=i i=i 

where ao > 0, aj > (j = 1, . . . ,r) with at least one aj > 0, (3j > (j = 
l,...,s), ? > and I7I < 1. The linear GARCH(r, s) model is a special 
case of (5.8) with <j = 2,7 = 0. Wu and Min [75] showed GMC for linear 
GARCH(r,s) models. Let Zt = (|et| -7^t)S C^t = (ao^t,0, ...,ao,0,.. . , O)'^^,^^)^-^, 
of which the (r + l)st element is ao and 

/ aiZt 



A. 



UrZt 

I(r-l)x{r-l) 0(r-l)xl 



ai 



\ 



o 



(s— 1) xr 



(iiZt 



PsZt \ 



o 



(r— l)xs 



/3i • • • 

^(s-l)x(s-l) 0(s_i)xl J 



Ling and McAleer [40] showed that Xt £ jC™*" for some m G N if and only if 
(5.9) A{E(Af™)} < 1, 

where (SD is the usual Kronecker product and A (A) is the largest eigenvalue 
of the matrix (A'A)^/^. Further, Xt admits a casual representation (1.1); see 
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Theorem 3.1 of Ling and McAleer [40]. It turns out that (5.9) also imphes 
GMC(m?). 

Proposition 5.1. For the asymmetric GARCH{r,s) model (5.8), let 
£t G <; > 1. Then Xt is GMC{m<;) if (5.9) holds. 

Proof. Let Yt = [{\Xt\--fXtY , . . . , {\Xt-r+i\-lXt-r+iY ,hf , htl]' 
Then Yf = A^tYt-i + ^qt [40]. Let yj, independent of {e^, t E Z}, be an i.i.d. 
copy of Yq. We recursively define = A^tYl_i + C?t , * > 1- Let = - y/. 
Then Yt = A^tYt-i- Applying the argument of Proposition 3 in [75], we have 

~ -^e;t -'t-1 — ■ ■ ■ — ^fi -'0 

Thus £(1;^™) = [E(^®"^)]*E(y(f'^) since A,t,...,A,i are i.i.d. By (5.9), 

< Cp^ for some p E (0,1). So E(|/i^/^ - {Ky/'^\^) is bounded 
by Cp* and 

E(lXj - X;]'"^) = E(eJ"^)E(iy)^- y^l"^) < CE(l/i^/^ - (/i^^/Y') < Cp\ 
where the inequality \a — h\''<\a''—W\, a > 0, 6 > 0, ? > 1, is applied. □ 

Example 5.5. Let £t be i.i.d. with mean and variance 1. Consider the 
signed volatility model [78] 

(5.10) Xt = £t\st\^''^, st = g{£t-.i) + c{et^i)st^i, ? > 0. 

When st = hl> 0, (5.10) reduces to the general GARCH(1, 1) model [31, 41] 

Xt = etht, hi = g{et-i) + c{£t-i)h'l_^, <j > 0. 
We shall show that the model (5.10) satisfies GMC under mild conditions. 

Proposition 5.2. For the model (5.10), /etEdei]"^) < 1, E{lc(ei)l"} < 
1 and g{ei) ^C"", a> 0. Let ? > 1. Then Xt is GMC{c;a). 

Proof. By Theorem 5.1, st is GMC(a). Since ^{{\st\^/'' - Jsj]^/'^)'^"} < 
E(lst - 41°) and Xt = et\st\^''', Xt is GMC(?a). □ 

Since E{lc(ei)l"} < 1 implies E{log lc(ei)l} < 0, by Theorem 1 of [78], 
Xt has a unique stationary solution which admits the casual representation 
(1.1). 

APPENDIX 

We now give the proofs of the results in Sections 2-4. 
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A.l. Proof of Theorem 2.1. For presentational clarity we restrict J = 
C {l,...,m} and hence Zj^ corresponds to the real parts of 
Sn{0ji)- The argument easily extends to general cases. Let 



JO 



Tn = '^HkXk, where //^ = i^k (c, J) = ^ 

k=i 1=1 xh'fiGji) 



l<k<n. 



Since /* := minR/(0) > 0, there exists ;U* such that < for all c£^lp 
Let dnih) = Efc=i+h fJ-kfJ'k-h ifO</i<n-l and = 



and J G i 
if h> n. Note that 



^cos( 

k=l 



n 



cos[{k + h)9j ] = - cos(/i6'jJl 



ji)-'-ji=Jii ■ 



Then it is easily seen that there exists a constant Kq > such that for all 
/i> 0, 



Tn{h) = sup sup 



< 



Koh 



n 



Clearly r„(/i) < + (27r/*)-i =: Ki. So we have uniformly over J and c 
that 



(A.l) 



II nil ^ 






n 





d„(0)r(0)+2^d„(/i)r(/i)-l 

h=l 



< 2 ^ r„(/i)r(/i) < ^ K2 min(/i/n, l)r(h) 
/i=0 /i=0 



by the Lebesgue dominated convergence theorem, where K2 = 2{Kq + Ki). 

Let fn = Efc=i Mfc-^fc, where = E(Xfc|efc_^+i^, . . . , e/c) are ^-dependent. 
So 5i = \\Xq — Xq\\ ^ as £ ^ 00. If /c < thenPo-'^fc = E(T'o-'^fc|efc-£+i) • • • ,^0) 
By Jensen's inequality ||Po-^fc|| ^ ll'^o-'^fcll- If A; > then VoXk = 0. Clearly 
\\Vo{Xk — Xk)\\ < 26i. By the Lebesgue dominated convergence theorem, 
(2.1) entails that 

n 1 1/2 00 



T — T 

I n -'- n I 



(A.2) 



j=-oo 
00 



<fi.Y.\\^o{Xk-Xk) 

k=0 



<fi,^2mm{\\VoXkl< 

fe=0 



0. 



Let gn{r) = r'^B[X^l{\X\ > V^/r)]. Since E(X2) < 00, lim„^oo5n(r) = 
for any fixed r > 0. Note that gn is nondecreasing in r. Then there ex- 
ists a sequence r„ t cxo such that Qnirn) 0. Let = Xfcl(|Xfc| < \/nlrr^ 
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and Tn,Y = Efc=iMfe^fc- Then HY^ - Xk\\ = o(l/r„). Since Yk - Xk are i- 
dependent, 



(A.3) ||r„,y-T„||<^ 



a=l 



b<n,e\{b-a) 



o{y/n/rn), 



where i\h means that £ is a divisor of h. Let pn = \_'fj/'^\ and blocks Bf = 
{a£-N:l + {t-l){pn+i)<a<Pn + {t- + i)}, 1 < t < t„ := [1 + 

{n - Pn)/ [Pn + t)\ . Define Ut = J2aeBt f^aYa, K = J2t=i Ut, Rn = Tn,Y - 

Vn, W = {Vn- E(K))/VH and A = r„/^ - W. Then Ut are indepen- 
dent and ll-Rnll = O(y^) since Ya are ^-dependent. Note that |E(T4i)| = 
0{n)\E{Yu)\=o{^/rn). Then by (A.3), 



(A.4) 



V^||A|| < |E(K)| + \\Vn - fnW = o{^/rn) + O {./T;, + / T n) 

= 0{y/t^). 



Since \Ut\^ < filplEa^B, l^aP and E(y,2) < B{Xl),B{\Ut\') = 0(p3^/r„). 
By the Berry-Esseen theorem ([13], page 304), 



snp\P{W<x)-^x/\\W\\)\<Cj2^{\Utf) X ||K-E(K)ir 



(A.5) 



t=i 



= 0{tnplV^/rn) X n-3/2 = 0{p-^). 
Let 5 = 5n=Pn^'^. By (A.4), (A.5) and 

Y{W <w-5)- P(|A| >5)< Y{W + A < u;) 

<P(VF<w + 5)+P(|A| >5), 



(A.6) 



we have sup^ |P(f„ < ^x) - $(V^x/||f'„||)| = 0[p-2 + P(|A| >5) + 5 + 
5"^] = 0(5) since sup^. |<I>(x/cJi) — (t{x/a2)\ < C|o"i/o"2 — 1| holds for some 
constant C. 

Let Wi = fn/V^, Ai = (T„ - fn)/^ and = = (||r„ - f„||/^)i/2. 
We apply (A.6) with VF, A replaced by Wi,Ai, 



sup 



n- ) \\\Tn\ 



^Jnx 



:0(P(|Ai| >7?) + 5 + r/ + 772). 



So the conclusion follows from (A.l) and (A. 2) by first letting n — > oo and 
then i oo. 



18 



X. SHAO AND W. B. WU 



A. 2. Proof of Theorem 3.1. The foHowing two lemmas are needed. 

Lemma A.l ([76]). Assume (3.1) with a = k for some A; G N. Then there 
exists a constant C > such that for all < mi < • • • < rrik-i, 

I cum(Xo, X^, , . . . , J I < . 
Lemma A. 2. Let the sequence G N satisfy Sn<n and Bn = o{sn); let 

(A.7) y„:=y„(A) = (2^)-i ^ X^X^+ka{khn) cos{kX) . 

k = — Bn 

Then under GMC{A) we have \\ J2i=i{Yu -B{Yu)}f Sn^n^r^ 

Proof. Let L(s) = {(mi, m2, ma) G : maxi<j<3 \mi\ = s} and c(mi, m2, 
m-s) = cum(XQ,Xmi,Xm2,Xm3)- So #L{s) < 6(2s + 1)^. By Lemma A.l, we 
have 

oo 

|c(mi,m2,m3)| < ^ |c(mi, m2, ms)] 

r?ii,m2,m3£Z s=0 (mi,r?i2,m3)S-L(s) 

oo 

<c£sV/[4(4-^)]<oo. 

s=0 

See also Remark 3 in [76]. Then the lemma follows from equations (3.9)- 
(3.12) in [56], page 1174. □ 

Proof of Theorem 3.1. Let p = p{A), at = a{khn) cos{k\) and 

/ B„ n —In \ 

ivrVra-On \fc^Ou=n-A:+l k=-B„u=n+k+l I 

By the summability of cumulants of orders 2 and 4 (cf. [57], page 139), 
\\hn{\)\\ = {nBn)~^^'^0{Bn). Recall (A.7) for the definition of and let 
gn:=9n{\) = TZ=iYu{\). Then 

(A.8) V^n{fn{\) - E(/n(A))} = ^" "^"^ + hn{\) " E(/i„(A)). 

For A; G Z let Xk = 'Ei{Xk\ek-i+i, ■ ■ ■ ,£k), where l = ln = [clognj and c = 
— 8/log/9. Let Yu := Yu{X) be the corresponding sum with Xk replaced by 
Xk- Observe that Xn and X^ are i.i.d. if |n — m| >l and Y^ and Y^ are i.i.d. if 
|m — "Lil > 2Bn + /. The independence plays an important role in establishing 
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the asymptotic normality of gn ■= 5n(A) = J2u=i YuW- Then \\gn - gn\ 
0(1) since 



(A.9) \\Yu-Y^\\<i2TTy^ \\XuXu+k-XA+k\\\ak\=0{Bnp'^^). 



k = -Bn 



(A.IO) 



Let = n/(logn)2+8/^ p„ = [^^/^S^^J and g„ = LV^^^^n^^J • Then 

Pn,qn ^00, qn = o{pn), 

2Bn + I = o{qn) and kn = [n/ {pn + qn)\ 00. 

Define the blocks Cr = {j G N : (r - + g.„) + 1 < j < r{qn+Pn) - ^n}, 

1 < ?■ < 5r. = {i G N : r(p„ + g'n) - + 1 < i < '^(o'n +Pn)}, 1 < <K - 1 
and cSfc„ = {j G N : fe„(p.„ + - + 1 < j < n}. Let f/,. = Eje£r ^^^"^ 
= Z^jeSr r Observe that f/i, . . . , U^^ are i.i.d. and Fl, . . . , Vfc^_i are also 
i.i.d. By Lemma A. 2 and (A.9), 



||C/i-E(C/i)|| 



(A.ll) 



^{y,-E(yo)} 



+ 0(Pn||yO-yo||) 



Similarly, H^i -E(^i)|| ~ (g„S„c72)i/2 + o(g„5„p'/4). By (A.IO), 

var(yi + • • • + I4J = - 1)11^1 - E(yi)f + \\Vu^ - E(yfcjf 

= 0{knqnBn) + 0[(Pn + Q'ri)-Bn] = o{nBn). 

Then we have (nB„)-V2{^^ _ E(5„)} ^ iV(0,cj2) if 
(A.12) (ni?J-i/2 ^{[/^ _ E(C/i)} ^ Af(0, a^). 

r=l 

Let T = 2 + 5/2. Case (i) [logn = o(i?„)]. By the triangle and Rosenthal 
inequalities 

/ \ip^-h)/i\ -I 

j=l k=-Bn 



Pn -i 
M=l k = — Bn 



<E 



<0(/)./p„// 



i-1 



<0(v^)E 



/i=0 



L{B„-i-h)//J 

E X:_B„+h+jia-Bn+h+jl 
j=0 



o[{VpJ)i^/bji] = oiVpAi). 
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On the other hand, since Xh+3jiXh+3ji+k, < j < [{pn - h)/ (3/)J , are i.i.d., 



Pn 

u=lk=l-l 



< E 



Pn 



(A.13) 



3i 

< E E 

k=l-lh=l 



l{pn-h)/{3l)] 

i=o 



Then we have ||J7i ||t = 0{\/pnBnl + P\/pn/l) = 0{\/pnBnl). Case (ii) [i?,. 



O(logn)]. By the argument of (A.13), ||C/i||^ = OiBJy%Jl)=0{y/pAl)- 
It is easily seen that the 0(-)-relation holds uniformly over A G [0,7r], that 
is, 



(A.14) 



sup \\Ui{X)\\r = 0{lVPr^n 

Ae[0,7r] 



Then \\Ui — E(C/i)||^ = o[{nBn)^/'^kn^^'^] and the Liapounov condition holds. 
By the central limit theorem and (A. 11), we have (A. 12). So (3.3) follows 
from (A.8). □ 

A.3. Proof of Theorem 3.2. We adopt the block method. Let Ur{X), r = 
1, . . . , be i.i.d. block sums with block length p = Pn= [n^~^''/^(logn)~^/^~^J 
and V^(A), r = 1, . . . , A;„ — 1, be i.i.d. block sums with the same block length 
Q = Qn = Pn- The last block Vfc„(A) is negligible. Note that Bn = o{pn) since 
77 < (5/(4 + 5). Let I = ln= [-81ogn/log/)(4)J as in the proof of Theo- 
rem 3.1. Define Ur{\)' := Ur{\)x l(|C/r(A)| < dn) for r = l,...,kn, where 
dn = [V ni?„,(log n)""*"/^] . The following lemma is needed. 



Lemma A. 3. Under the assumptions in Theorem 3.2, we have 



(A.15) 

(A.16) 
(A.17) 
(A.18) 



E max |T4„(A)| =0(v/pn/i3n), 

\Ae[0,7r] / 

Ef max \hn{\)\] = o(l), 

VAe[0,7r] / 

max var(C/i(A)) = 0{pnBn), 

Ae[0,7r] 

var([/i(A)') = var(?7i(A))[l + 0(1)], 



where the relation o(l) in (A.18) holds uniformly over [0,7r]. 
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Proof. Let z = kn{p + q) + l-q and r = 2 + 6/2. Then 



E( max |V1,„(A)| 

vAG[0,7r] 



< 



For |j| < /, II X]u=^^u^u+j|| = 0{-\/pnl) since X^X^+j is 2Z-dependent. When 
IjI > ^ II E"=^^«^«+if = llu,u'=z ^{XuXu+jX^'X^'+j) = 0{pnl) since the 
sum vanishes if |n — ti'| > I. So Emax;^g[o,7r] l^fcn(-^)l — 0{\/p^Bn). Let hn{X) 
be the corresponding sum of hn{X) with XuX^^^ replaced by XuX^+k- As at 
(A. 9), we have Emax;^g[Q \hn{X) — hn{X)\ = o(l). To show (A. 16), it suffices 

to show Emax;^g[o,7r] l^n(A)| = o(l) which follows from a similar argument 
as in the proof of (A. 15). Regarding (A. 17), we have 



var(C/i(A)) 



B„ 



J2 E {XuXu+k-r{k)}ak 



u=l k= 



Bn 



E E {r{u — u')r{u — u' + k — k') 



,,u'=lk,k'= 



'Bn 



+ r{u' — n + k')r{u' —u — k) 

+ cum(Xo, Xk, Xu'-u, Xu'-u+k')}akOik' 



Then Ii is bounded by C J2 



h+h + h- 

p-i 



2Bn 



:2S„ + l-|<7|)|r(/i + 



g)\, which is less than Cp{2Bn + l){J2'k^=~oo k(^)l)^- Similarly, smaller bounds 
can be obtained for I2 and I3 due to the summability of the second and 
fourth cumulants. Thus max;^gjo n] var(?7i(A)) = 0{pnBn)- For (A. 18), let v = 
var{C/i(A)-[/i(A)'} and c = E([/i(A)')E{[/i(A) - [/i(A)'}. Then var(C/i(A)') = 
var([/i(A)) - u + 2c. By Markov's inequality and (A.14), v < \\Ui{X)\\l/d^-~^ = 
o{pnBn) and similarly c < ||C/i(A)||^~'^^/(i^~-^ = o{pnBn)- By Lemma A. 2 and 
since / is everywhere positive, (A. 18) follows. □ 

Proof of Theorem 3.2. Let Hn{X) = J2r=iPr{X)-E{Ur{X)}], /7„,(A)' : 
E^=i[C/r(A)'-E{C/,(A)'}]. Let Xj = nj /t^J = 0, . . . ,tn, t„ = [^^ log(5„)J . 
Let Cn = 1/(1 -37r/ log 5„) ^ 1. By Corollary 2.1 in [72], maxAg[o,^] |^^n(A)| < 
Cnmaxj<(„ \Hn{Xj)\. By (A. 17) and (A. 18), there exists a constant Ci > 1 
such that max;^g[o,7r] var(C/i(A)') < CiPnBn- Let a„ = (Cini?„logn)^/^. By 
Bernstein's inequality, we have 

<EP(l^n(A,)'| >4a„) 

j=0 

..2 



P( max |i/„(A,)'| > 4a„, 

\0<j<tn 



0{tn) exp 



2knCiPnBn + 16(i 



0(1). 
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Let Ur{X)" = Ur{X) - UriXy and if„(A)" = Hn{X) - Hn{X)' . Then by Markov's 
inequality and (A. 14), 

tn 



pf max \Hn{X,)"\>Aar^ <Y.^{\Hn{Xj)"\>Aan) 

\0<J<tn J 'ZTc^ 



j=0 

t 

<E 



160-2 n'2r/'r-2 



o 



{Bn log n) {n/pn){y/PnBnlog uY 



, {nBn log n) {nBnY/'^-^ (log n)-^/2+i , 

= 0((logn)-^/4) = o(l). 

So max;i,g[o,7r] l-f^n(A)| = Op(an)- Clearly the same bound also holds for the 
sum E'=lVr(A)-E{K,(A)}]. By (A.9), Emax;,g[o,,] |5n(A) -5n(A)| = o(l). 
By (A.15), (A.16) and (A.8), we have (3.4). □ 

A.4. Proof of Propositions 4.1 and 4.2. Let 5j^k = lj=fc- 

Lemma A. 4. Letm= [(n-l)/2j. (i) Assume (A4'), (A5') andXt^C^. 
T/ienmaXj_fc<m|cov(/?,/|)-4/|(5j_fc| =0(l/n). (ii) Assume (A4'), (A5) and 
e T/ien maxj_fc<m |cov(/,,4) - /|5j,fc| = 0(l/n). 



Proof. We only show (i) since (ii) can be handled similarly. Note that 
cov(/|,l|) 



ti,s,G{l,...,n},Z=l,...,4 



e 



«(<l-<2+<3-i4)Aj-i(si-S2+S3-S4)Afc 



X cov(Xt,Xf,Xt3Xt„X,,X,,X,3X 



S4. 



By Theorem II. 2 in [57], we have 

coY{Xt^Xt^Xt.^Xt^,Xs^Xs.,Xs.^XsY) = ^cum(Xi^.;ij e ui) • • •cum(Xi^;ij G 

where X)?; is over all indecomposable partitions u = U • • • U of the two- 
way table 

Xt,{+) XtY-) XtY+) XtY-) 
Xs,{-) Xs,{+) XsY-) XsY+)- 

The signs in the above table are from the exponential terms in the sum 
(A. 19). Since 'Ei{Xt) = 0, only partitions v with ^Vj > 1 for all j contribute. 
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One of the many indecomposable partitions consisting only of pairs with + 
in t matched to — in s [say, {t2,S2), (^3,53), (^4,54)}] leads to the 

sum [A(Aj,Afc)]^, where 

1 " 

^(A,,Afc) = — J2 r{h- si)e''^^^-''^^^ = f{X,)l,=k + 0{l/n). 

tl,Sl=l 

The other indecomposable partitions consisting entirely of pairs (with + in t 
matched to - in s) are {(ti,S3), (t2,S2), {t3,si), {ti,S4)}, {(ti,si), (t2,S4), {^,33), 
(ti, S2)} and {(ti, S3), (i2) ■S4), (t3, si), (t4, 52)}- It is easily seen after some cal- 
culations that partitions containing entirely pairs but with at least one + in 
t matched to one + in s result in a term of order 0(l/n) for any j,k. All 
other partitions that are not all pairs will give a quantity of order 0(l/n) 
due to the summability of cumulants up to the eighth order. Finally, it is 
not hard to see that 0{l/n) does not depend on (j, k). Thus the conclusion 
is proved. □ 

Lemma A. 5. Assume Xt e £^ , (A2), (A3'), (A4') and (A5'). Then 
var*(e^) I in probability and E*(|eJ|'^) = Op(l). 

Proof. By (A3'), / is a uniformly consistent estimate of /. It remains 
to show 

1 ^ /. 1^/2 

— ^-^^1, j^Y.-h^'^ in probability and 

j=i j=i Jj 



(A.20) 



1 ^ /4 
7=1 •' J 



J 



By Proposition 10.3.1 in [8] and Lemma A. 4, we have E(/j) = fj + o(l) and 
E(/|) = 2fJ + 0(1) uniformly in j. Thus the first two assertions follow from 
Lemma A. 4 since their variances go to as n —> 00. By Lemma A.4, E{lf) = 
cov{Ij,Ij) + (E/|)2 = 8fj + 0(1) uniformly in j, and the last assertion 
holds. □ 



Remark A.l. For linear processes, Franke and Hardle [24] remarked 
that their consistency result strongly depends on the asymptotic normality 
of fn and the weak convergence of Fj ^(x) (see Corollary 2.2). The latter 
condition holds under ei £ and (4.2) by Chen and Hannan [12]. Franke 
and Hardle [24] further conjectured that their results hold assuming only 
ei G under which the weak convergence of Fj ^{x) might be true. How- 
ever, it seems from our argument (see the proof of Proposition 4.1) that it is 
not the weak convergence of Fj ^{x) but the first two conditions in (A.20) 
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that play key roles; compare Proposition Al in [24]. The proof of the second 
assertion in (A. 20) (see Lemmas A. 4 and A. 5) in a general setting needs the 
stronger eighth moment assumption. 

Let a V 6 = max(a, b) and af\b = min(a, h); let f2{k) = J^'" p{\)e'-^^ dX, 
r2ik) = J^''fWe'''dX, r{k)=Io^me'''dX and F+ = {I, . . . ,[n/2\}. 
By (A3), maxfcgz lUk) - r2{k)\ < 27rmaxA \P{X) - f{X)\ = op(6„). 

Proof of Proposition 4.1. By Lemma 8.3 of [4], the convergence 
under the d2 metric is equivalent to weak convergence and convergence of 
the first two moments. By (A6), it suffices to show that 

(A.21) nbnYav*if:{X))^a\X), C{V:{X)\{X,}]^,) ^ N{0,a\X)) 

in probability. 

Let Aj = Y.k=~B,, a{kbn)e-'^^{e'^'^^ + e-'^'^i). Since the resampled resid- 
uals {e^} are i.i.d. given we have var*(/|) = /Jvar*(eJ), and, 
since Iq =0, n6„ var*(/*(A)) = yqi* {e\)Rn{X) +op(l), where 

= E a(fcfen)a(A:'6„)e-^(^-^'') 

Ic ^fc — ^^^^ 

X /|{e*^^('=-'=') + e^'^^('=+^'')} + op(l) 
J2 a{kbn)a{k'bn)e-'^^''-^''^ 



nBn 



X {r2ik - k') + r2{k + k')} + op(l) 



-1 Bn 

— J2 a{kbn)aik'bn)e-'^^'-''^ 



k,k'=-Bn 

X {r2{k - k') + r2{k + k')} + op(l) 
= i?«(A)+42)(A)+op(i) (say). 

Let I3n{k) = R';^\x)e'^^ dX and /3(fc) = /^f^ S\ a^{u)p{X)e'^^ dudX. Then 
/?n(A:) = ^ a{jbn)a{{j-k)bn)^r2{k) a\u)du. 
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Since |/3n(^)| < C\r2{k)\ and X]fcezk2(^)| < c«, by the Lebesgue domi- 
nated convergence theorem, R^n\x) ^ f{X) j\a^{u) du. For i?i^^(A),A/ 
0, zb-TT, we have 



" h=-2Bn fc=-B„+OV/i 

— Yl r2{h)e^'^'O{l)^0. 



It is easily seen that i?i^^(A) = Rn\x) when A = 0, ivr. Hence by Lemma 
A. 5, n6„ var*(/*(A)) ^a^(A) in probabihty. 

Finally, since {e*} are i.i.d. conditional on {Xi, . . . ,Xn}, by the Berry- 
Esseen theorem and Lemma A. 5, we have 



sup 



< 



n6„var*(/*(A)), 



which implies C{V*{X)\Xi, ...,X„)^ N{0,a'^{X)) in probability since Bz = 
o(n) and sup^. \^{x/ai) — ^{x/a2)\ < C\ai/a2 — 1| for some constant C. □ 

Proof of Proposition 4.2. Since = o(n) and f{k) = a{kbn)f{k), 
\k\ < Bn and otherwise, we have Bl[W f*{>^) - /(A)] = J„(A) + op(l), 
where 

•^nW = Tr^ E a(A:Mr(A;)e-*^'^(a(A;6„)-l). 

ZTT ^ — 

It remains to show E(J„(A)) ^ C2/"(A) and var(J„(A)) ^ 0. By (Al), (A4) 
and (A5), 

732 Bn, 

E(^(A)) = ^ E a(A:Me-^^^(l-|fc|/n)r(A:)(a(fc6„)-l) 

k = -Br, 
d2 

= E a(A:6„)e-^'=V(A;)fe262c2(l + o(l))^C2r(A) 
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and 

var(J„(A)) = ^ ^ a{kK)a{k'K){a{kbn)-l){a{k'hn)-l) 

X e-^('=-'=')^cov(f(fc),f(/fc')) 
(A.22) =il±4iM V a{kK)a{k'K)k^k'^e-^^'^-^')^ 

n— |fc| »i— |fc'| 

X 51 XI cov(XtXi+|fc|,Xt'Xt,+|fc/|) 

-Bn n—kn—k' 

= OiBt/n^) 5] 5] |cov(X^X^+fc,X^,X^,+fcOI• 
fc,fc'=o t=i t'=i 

Note that cov(XtXi+fc, Xi/X^z+fc/) = r{t - t')r{t + k - t' - k') + r{t - t' - 
k')r(t' — t — k)+cum{Xt,Xt^li.,Xti,Xti^l.i). The contribution of the first term 

r{t - t')r{t + k-t'-k') to (A.22) is 0{Bl/n) E^I-b^ T.f=-2n \r{h)r{h + 

s)\ = 0{B\ln) = o(l) since J2kez < Similarly, the contribution of 
the second term to (A.22) approaches zero as n — > oo. The third term is 
0{B^/n) = o(l) due to the summability of the fourth cumulants. □ 
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